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Abstract 

It is often important to incorporating covariate information in the 
design of clinical trials. In literature, there are many designs of us- 
ing stratification and covariate-adaptive randomization to balance on 
certain known covariate. Recently Zhang, Hu, Cheung and Chan 
(2007) have proposed a family of covariate-adjusted response-adaptive 
(CARA) designs and studied their asymptotic properties. However, 
these CARA designs often have high variabilities. In this paper, we 
propose a new family of covariate-adjusted response-adaptive (CARA) 
designs. We show that the new designs have smaller variabilities and 
therefore more efficient. 



1 Introduction 

Response-adaptive designs for clinical trials incorporate sequentially accru- 
ing response data into future allocation probabilities. A major objective 
of response-adaptive designs in clinical trials is to minimize the number of 
patients that is assigned to the inferior treatment to a degree that still gener- 
ates useful statistical inferences. The preliminary idea of response adaptive 
randomization can be traced back to Thompson (1933) and Robbins (1952). 
A lot of response-adaptive designs have been proposed in literature {e.g., 
Rosenberger and Lachin 2002, Hu and Rosenberger, 2006). Much recent 
work has focused on proposing better randomized adaptive designs. The 
three main components for evaluating a response-adaptive design are allo- 
cation proportion, efficiency (power), and variability. The issue of efficiency 
or power was discussed by Hu and Rosenberger (2003), who showed that 
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the efficiency is a decreasing function of tfie variability induced by tfie ran- 
domization procedure for any given allocation proportion. Hu, Rosenberger 
and Zhang (2006) showed that there is an asymptotic lower bound on the 
variability of response-adaptive designs. A response-adaptive design that at- 
tains this lower bound will be said to be first order efficient. More recently, 
Hu, Zhang and He (2008) proposed a new family of efficient randomized 
adaptive designs that can adapt to any desired allocation proportion. But 
all these studies are limit to the designs that do not incorporate covariates. 

In many clinical trials (Pocock and Simon, 1975, Tavcs, 1974), covari- 
ate information is available and has a strong influence on the responses of 
patients. For instance, the efficacy of a hypertensive drug is related to a pa- 
tient's initial blood pressure and cholesterol level, whereas the effectiveness 
of a cancer treatment may depend on whether the patient is a smoker or a 
non-smoker. Covariatc-adaptive designs have been proposed to balance co- 
variates among treatment groups (see Pocock and Simon, 1975, Taves, 1974 
and Zclcn, 1974). Hu and Rosenberger (2006) defined a covariate- adjusted 
response-adaptive (CARA) design as a design that incorporate sequentially 
history information of accruing response data and covariate as well as the 
observed covariate information of the incoming patient into future allocation 
probabilities. 

In a CARA design, the assignment of a treatment depends on the history 
information and the covariate of the incoming patient. This generates a 
certain level of technical complexity for studying the properties of the design. 
Zhang, et al (2007) got a limit success on CARA designs by proposing a 
class of CARA designs that allow a wide spectrum of applications to very 
general statistical models and obtaining the asymptotic properties to provide 
a statistical basis for inferences after using this kind of designs. However, 
the CARA designs in Zhang, et al (2007) often have high variabilities and 
therefore are not efficient (Hu and Rosenberger, 2003). The major purpose 
of this paper is to study the variability and efficiency of CARA designs and 
to propose a new family of CARA designs with small variabilities. 

The paper is organized as follows. In Section 2, the Fisher information 
and the best asymptotic variability are derived for a CARA design with any 
given target allocation proportion. We will find that the Fisher information 
and the variability depend on the distribution of each individual response, 
the target function and the distribution of the covariate. In Section 3, we 
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propose a new CARA design that can adapt to target any allocation function 
and in which a parameter can be tuned such that the asymptotic variability 
approaches to the best one. The design proposed by Zhang, et al (2007) is 
a special case of this new design and has the largest variability in all this 
kind of designs. The new design is also an extension of the doubly adaptive 
biased coin design (BDCD) proposed by Eisele and Woodroofe (1995) and 
Hu and Zhang (2004a). The technical proofs are put on the Appendix. 

2 Variability and efficiency of CARA designs 

2.1 General framework of CARA designs. 

Given a clinical trial with K treatments. Supposing that a patient with a 
covariate vector ^ is assigned to treatment k, k = 1, . . . , K, and the observed 
response is Y^, assume that the response has a conditional distribution 
fkiVkWk,^) for given the covariate ^. Here Ok, k = 1, . . . ,K, are unknown 
parameters, and 0^ C M"^ is the parameter space of 6^- 

In an adaptive design, we let Xi,X2,.-. be the sequence of random 
treatment assignments. For the m-th subject, Xm = (Xm,!, ■ ■ ■ ,Xm,K) rep- 
resents the assignment of treatment such that if the m-th subject is allocated 
to treatment k, then all elements in are except for the A;-th component, 
Xjn,k: which is 1. Suppose that {1^,^, k = 1, . . . ,K, m = 1,2 . . .} denote 
the responses such that Y^^^ is the response of the m-th subject to treat- 
ment k, k = 1, . . . , K . In practical situations, only Y^^k with X^.k = 1 is 
observed. Denote = {Ym^i, ■ ■ ■ , Ym,K)- Also, assume that covariate infor- 
mation is available in the clinical study. Let be the covariate of the m-th 
subject. We assume that {(l^,i, . . . , Yjn^K, (,m), m = 1, 2, . . .} is a sequence 
of i.i.d. random vectors, the distributions of which are the same as that of 
(Fi, . . . , Yk, $.). Further, let JT^ = a(Xi, . . . , X^), "S^m = (t{Yi, . . . ,Ym) 
and 3frn = c(^i, . . . , ^m) be the sigma fields corresponding to the responses, 
assignments and covariates respectively, and let = (^{^mi'^m, ^m) 
be the sigma field of the history. A general covariate-adjusted response- 
adaptive (CARA) design is defined by 

V'm+l.fc =^{Xm+l,k = l|^m,^m+l) 

— ^i.-^m+l,k — l\'^rn-i'3^m,^rn+\)-i k = l,...,K, 
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the conditional probabilities of assigning treatments 1, ...,K to the mth pa- 
tient, conditioning on the entire history including the information of all pre- 
vious m assignments, responses, and covariate vectors, plus the information 
of the current patient's covariate vector. 

2.2 CARA designs with a target. 

Let A^m,fc be the number of subjects assigned to treatment k in the first 
m assignments and write = (-^m,i> • • • ,^m,K)- Then Nm = YlT^i -^i- 
Further, let N^^klx = J2m=i ^m,kl{$m = x} be the number of subjects 
with covariate x that is randomized to treatment k, k = 1,. . . ,K, in the 
n trials, and Nn{x) = Ylm=i ^{^^ — ^} be the total number of subjects 
with covariate x. Write = (Oi, . . . , Ok)- Because the value of 6 and the 
covariate determinate the distributions of the outcomes, and accordingly, 
the effects of each treatments, in many cases one would like to define a 
CARA design such that the "conditional" allocation proportion for a given 
covariate x converges to a pre-specified proportion which is a function of 
and X. That is, 



where ■7Ti{0,x), . . ., nxiO, x) are K known functions. We call them target al- 
location functions. Examples for the choice of target functions are discussed 
in Zhang, et al (2007), Rosenberger, et al (2001), Rosenberger, Vidyashankar 
and Agarwal (2001) and Hu and Rosenberger (2006). Recently, Tymofyeyev, 
Rosenberger and Hu (2007) developed a general framework to obtain opti- 
mal allocation proportion for i^-treatment clinical trials. However, when 
P(^ = x) = 0, for example, in the continuous covariate case, the "condi- 
tional" allocation proportion Nn^k\x/^nix) is not well-defined because both 
the numerator and denominator are zeros almost surely. As compared with 
(12. 1|) . it is more meaningful to allocate each individual patient to treatment 
k with a probability close to iTk{0, x) for a given covariate x. So we consider 
a class of CARA designs with a property that 



The next theorem tells us that ([221) implies I^B)- Write pk{9) = E7rfc(0,^), 

k = i,...,K, p{e) = {pi{e),...,pK{e))and7v{e,x) = {TTi{e,x),...,7rK{e,x)). 




TTk{6,x), k = l,... 



K. 



(2.1) 



P{Xjn+l^k = l\^m,$m+l = x) ^ TTk{0,x) a.S. 



(2.2) 
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Theorem 2.1 // \2. g|) is satisfied, then 

TTkiOjX) a.s. on the event {Nn{x) 00} (2-3) 



-^n,fc|£c 

and 



^ ^ p,{e) a.s. (2.4) 
n 

Here, "A a.s. on B" means that P[B \ j4) = for two events A and B. 
Further, if the density of the covariate is positive at x, then 

r\On^oo Nn[B[X,r)) 

where N^,k\B{a^,r) = Em=i X^,kl{^n, e B{x, r)}, Nn{B{x, r)) = Yl=i e 
B{x,r)}, B{x,r) is a ball with the center x and the radius r. 

Notice, when P(^ = x) = 0, though the allocation proportion Nn^k\x/Xnix) 
is not well-defined, ()2.3p is trivial because P(A'^„(a;) — > 00) = 0. Accurately, 
()2.3p makes sense only in the discrete covariate case and (j2.5p is a version 
of (12. 3D for continuous covarites. 



2.3 Variability and efficiency. 

For response-adaptive designs which do not incorporate covariates, Hu, 
Rosenberger and Zhang (2006) found the lower bound of the asymptotic 
variability of a design, i.e., of the allocation proportions of the design. A 
design is called asymptotically efficient if its asymptotic variability attains 
the lower bound. Next, we study the variability and efficiency of the CARA 
designs. Suppose, given ^, that the response Yk of a trial of treatment k has 
a distribution in the exponential family, and takes the form 

fk{yk\i, Ok) = exp {(yfc/Ufc - ak{l^k))/4>k + hiVk, 4'k)} (2.6) 

with link function pk = hk{£,6l), where Ok = (Bki, ■■■ , Gkd), k = 1, . . . ,K, 
are coefficients. Assume that the scale parameter 4>k is fixed. It is easily 
checked that E[Yk\^] = a'^{fik), Var(yfc|^) = a'l{fik)(t>k, 

d\ogfk{yk\i,Ok) If -s^,, ttaT\t 

= —{yk - akinkHhi^o^ )^, 

oOk cpk 
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d'^ log fk{yk\$, Ok) 1 



and, given ^, the conditional Fisher information matrix is 



d^iogfkiYkltdk 



1 

<pk 



For the observations up to stage n, the hkehhood function is 

n K 

j=l k=l 

K n K 

= \{\{[fk{Y,,k\i3,ek)]''-' ■.= \{Lk{ek) 

k=lj=l k=l 



(2.7) 



with log Lk{6k) oc Xj^k{Yj^k-akifJ'j,k)}, fJ'j,k = hk{Ol^j), k = 1,2, . . . ,K. 

Write 

Ifc = E[7rfc(0,O4(0A:|^)], k = l,...,K. 

Then 



(21 



52 logL(6') 



del 



^ Ee [Xj^khiOkl^j]] = nik + o{ 



nj 



It follows that the entire Firsher information matrix is 

"a^ logL(0)' 



IniO) 



ndiag{Ii, . . . ,Ik) + o{n). 



Thus we obtain the following theorem. 

Theorem 2.2 Suppose the responses follow the generalized linear model 
\2. b]) and the design satisfies Let I{9) = diag{Ii, . . . ,Ik)- Then 

the Firsher information matrix satisfies 

In{e) = nI{e)+o{n), 

and the asymptotic variance- covariance matrix of an asymptotic efficient 
estimator of 6 is I~^{6)/n. 
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The limit proportion p{0) = {pi{6), . . . , pxiO)) depends on both the 
parameter 6 and the distribution of ^. When the distribution of ^ is known, 
according to Theorem 12.21 the asymptotic variance-covariance matrix of 

an asymptotic efficient estimator of p{9) is ^^f^-^^^(^) • While, 

if the parameter 6 is known, then the non-parameter maximal likelihood 
estimator (MLE) of p{0) = E[7v{8,^)] is ^ Ylm=i and its variance- 

covariance matrix is Var{7r(0, ^)}/n. So, in the general case that the param- 
eter 9 and the distribution of ^ are both unknown, the asymptotic variance- 
covariance matrix of an asymptotic efficient estimator of p{9) is B{0)/n, 
where 

= i?|^I-H«)(^)%VarWO,«)}. 

The allocation proportion Nn/n in a adaptive design with property ()2.2|) 
will converge to p{9) according to Theorem 12.11 So we can now define an 
asymptotically efficient CARA design as follows. 

Definition 1 A covaraite- adjusted response- adaptive design with target func- 
tion Tv{9,x) is called asymptotically efficient if it satisfies 112. S\) and 

n'/^{Nn/n-p{9))^N{0,B{9)), (2.9) 

and B{9) is called the best asymptotic variability. 



Zhang, Hu, Cheung and Chan (2007) proposed a CARA design (we refer 
it as ZHCC's design) by defining 

P(^m+l,fc = M'^m,$m) = T^k{(^m, ^m+l), 

where 9m is the MLE of 9 based on the observations up to stage m. It has 
been shown that ZHCC's design satisfy (j2.2p and 

where 

= ^'-^I-HO) {^y + d^ag{pm - {p{9)f p{9). 
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It is easily seen that 

diag{p{e)) - {p{e)fp{e) 

=Var{^(0,O} + E [dm<7(7r(0,C)) - (^(0, C))^ vr(0, ^)] > Var{7r(0,C)}, 

where A > B means that A — S is non-negative definite. Hence, ZHCC's 
design is not asymptotically efficient. 

It is of significance to find an asymptotic efficient CARA design for any 
given target function 1^(6, x). In the next section, we will propose a new 
class of CARA designs with an asymptotic variability being able to approach 
the best one. 



3 Covariate-adjusted DBCD 

Our new design is based on the idea of the doubly adaptive biased coin 
design (BDCD) proposed by Eisele and Woodroofe (1995), and extended 
by Hu and Zhang (2004a). In the scenario without covariatcs, the Hu and 
Zhang's extension can target any desired allocation and can approach the 
lower bound of the asymptotic variability by tuning a parameter. In this 
section, we modify the DBCD to incorporate covariates. For simplification, 
we only consider the two-treatment case {K = 2). 

Covariate-adjusted DBCD (CADBCD): To start, we let Oq be an initial 

estimate of 6, and assign rriQ subjects to each treatment by using a restricted 
randomization. Assume that m {m > 2mo) subjects have been assigned to 
treatments. Their responses {1^-, j = l,...,m} and the corresponding 
covariates {^j, j = l,...,m} are observed. We let 6m = {0m,i,^m,2) be 
an estimate of 6 = {61,62). Here, for each = 1,2, 6m,k = 6m,kiYj,k,^j '■ 
^j,k = l)i = 1) • • • ) "T") is the estimator of 6^ that is based on the observed 
A^m,A:"Size Sample {(i^',fc;^i) ■ for which Xj^^ = l)i = 1 • • • j^}- Write pm = 
^I]iti7ri(0m,^i) and nm = 'n-i{6m,$m+i)- Next, when the (m + l)-th 
subject is ready for randomization and the corresponding covariate ^m+i is 
recorded, we assign the patient to treatment 1 with a probability of 



V'm+l,! = 7 Z XT 7 — : v7 (^-10) 



and to treatment 2 with a probability of V'm+l. 2 = 1 — V'm+l,!) where 7 > 
is a constant that controls the degree of randomness of the procedure, from 
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most random when 7 = to deterministic when 7 = 00. ZHCC's design is 
a special case of CADBCD with 7 = 0. 

Asymptotic properties. For studying the asymptotic properties, we 
assmne the target aUocation function tti{6*,x) satisfies the foUowing condi- 
tion. 

Condition A We assume that the parameter space &k is a bounded domain 
in W^, and that the true value Ok is an interior point of ©fc, k = 1,2. 

1. For each fixed x, Q < 711(0*, x) < 1 is a continuous function of 0* in 
the closure of &i x ©2- 

2. 7ri(0*,^) is twice differentiable with respect to 0* , and the expectations 

of\\dT:i{0,$)/dO\\'^ and sup \\d'^'Ki{0* dO'^\\ are finite for some 

\\0*-e\\<6 

5 > 0. 

Write V = E[7ri(0,^)], then < w < 1 due to Condition A.l. 
Theorem 3.1 Suppose that for k = 1,2, 
1 " 

Onk-Ok = -Y,^n^,khk{Ym,k,U){'^ + o{l)) +o{n-^^^) a.s., (3.11) 

n 

m=l 

where h^s are functions with E[hk{Yk,^)\(,] = 0. We also assume that 
E\\hk{Yk,$)\\'^ < 00, k = 1,2. Then under Condition\^ we have 

P{Xn,l = l)^V; P{Xn,l = l\^n-l,^n=x) ^Tli{0,x)a.S. (3.12) 

and 

Further, letVk = E{Tik{e,i){hk{Yk,i)fhk{Yk,i)], k = l,2,V = diag{Vr,V2) , 
a\ = E[7ri(0,^)(l-7ri(0,O)], = ^^^-.,{0 , = E'-^V{E'-^f, 
^ = ^IJIT^ = T^5r + ^2 + ^3- Then, 

Vn{Nn,i/n-v) ^ N{0,a^) and y/^{dn - 0) ^ N{Q,V). (3.14) 
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The proof of this Theorem is a httle complex and will be state in the 
Appendix. According to (|3.12p . CADBCD satisfies (pr2]) . The asymptotic 



variability cr^ of the design takes the values from the maximum 



when 7 = to the minim o"| + ct? when 



CXD. 



The next result for the generalized linear model is a corollary of Theorem 
The proof is given in the Appendix through the verification of Condition 
(I3TT]) . 

Corollary 3.1 Suppose the distributions of the responses follow the gener- 
alized linear model \2. 6j) and satisfy the following regular condition 



H{5) =: Ee 



sup 

\\z\\<6 



d^\ogfk{Yk\i.ek 



as 6^0, 



(3.15) 



where f{x)\'^ = f{b) — f{a). Under Condition\^ if the matrices Ii and I2 
defined as in Ii2.8\) are nonsingular and the MLE 6„^, which maximize the 
likelihood function 12. 7\ ), is unique, then we have 113. 12\) . 113. 13\} . and i3.14\ ) 
with V = 1-^(0) and 1(0) = diagih.h)- 

It is obvious that B{0) = cr^ + "^i best asymptotic variability 

of CARA designs with two treatments according to Definition [TJ For the 
CADBCD, 



a 



1 + 27 



+ B{e) > B{0) but \ B{e) as 7 



00. 



v{l—v) 



This means that the CADBCD is not asymptotically efficient but it can ap- 
proach an asymptotically efficient CARA design if 7 is chosen large. ZHCC's 
design is a special case of the CADBCD which has the largest variability. 



4 Conclusion Remarks 

We have proposed a family of covariate-adjusted response- adaptive designs 
that are fully randomized and asymptotically efficient. The CADBCD can 
be viewed as a generalization of Hu and Zhang's doubly adaptive biased 
coin design (Hu and Zhang, 2004a) for incorporating covariate information. 
The asymptotic properties derived here provide the theoretical foundation 
for inference based on the CADBCD. 
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In this paper, we have assumed that the responses in each treatment 
group are available without delay. In practice, there is no logistical diffi- 
culty in incorporating delayed responses into the CADBCD, provided that 
some responses become available during the course of the allocation in the 
experiment, and thus we can always update the estimates whenever new 
data become available. For clinical trials with uniform (or exponential) pa- 
tient entry and exponential response times (see Bai, Hu and Rosenberger 
(2002), Hu and Zhang (2004) and Zhang, et al (2006) for examples), it is 
easy to verify the theoretical results in Section 2 and 3. 



5 Appendix: Proofs 

Proof of Theorem [2711 Notice E[Xjn+i,k\'^m] = E[V'm+i,fc|=^m] ^ Pk{0) 
by ^ and {Em=i(^m,fc - E[X„,fe|^„_i]), is a martingale. ^ fol- 
lows immediately. For ([2^3]) . let = o-(^m,^m+i)- Then {Ylm=ii^rn,k - 
E[Xm,kWm-i])I{^m = x] IS a martingale with 

n 

^[{{Xm,k - E[X„,fe|^^_i])/{^„ = X]Y < Nn{x). 

m=l 

It follows that 

— iii— ^ — ^- > a.s. on {Nn{x) 00} 

^^n{X) 

by Theorem 3.3.10 of Stout (1974). On the other hand, 

El=l{^[X.n,k\^m~l] - Me,x))I{U = X} 



Nn{x) 

by ([22]). So, ([23]) is proved. For ([23]), notice 

Nn{B{x,r)) 



a.s. on {Nn{x) 00} 



P{^ G B{x,r)} > a.s. 
11 

With a similar argument we have 

n~*oo Nn{B{x,r)) n->oo Nn{B{x,r)) 

E[7rk{9,m^eB{x,r)}] 



P{$, £ B{x,r)} 



a.s. 
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Letting r \ yields (|2.5 



Proof of Theorem 13.11 The proof is a little complex and long. We will 
complete via four steps. 

Step 1. We show that (fXT3|) and 



+ 0(Y^log log m/m) a.s. 



(5.16) 



Write vri = 7ri(6>,^) for short. Let M„,i = YZ.=i " ^[X^,iY^m-u^n 

Mn,2 = Em=l (7ri(^,^m) " EvTi), Qn,k = Ylm=l Xrn,khk{Y^,k, ^m) for k = 

1,2, Qn = (Qn,i,Qn,2) and Mn,3 = Qn (^^^ ■ Then Q„ and M„j, 
j = 1,2,3, are martingales. According to the law of the iterated logarithm 
(LIL) for martingales, we have 

Qn = 0(\/log logn/n) and M„,j = 0{\/log logn/n) a.s.j = 1, 2, 3. 

(5.17) 

Hence, by (|3.1ip it is easily shown that 



6m — 6 = 0(\/log log m/m)a.s. 



(5.18) 



It follows that 

vfm =7ri(0m,^m+l) = 7ri(0,^m+l) + (^m " ^) 

aVi(r,e 



(0,e, 



m+1 , 



56* 



+ 0(1)||6>™ sup 

||0*-0||<(5 

=7ri(6>,^„+i) + (6>^ - + (fm - 6*) 



m+l I 



, ^.1^fogfog"^ 
+ 0(1) sup 

||6»*-0||<5 



It is easily shown that 



m=l 



ni+l, 



do 



a.s. 



.d-Ki 



(5.19) 

T 



T 



and 



log log m 
sup 



m=l 



ITT- ||0*-0||<5 



o((logn)^) a.s. 



o((logn)^) a.s. 
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It follows that 



J2Tfm=J2 M0,U+i)+J2^em-e) (e^\ +o{i^ognf) a.s. (5.20) 

m=l m=l m=l ' 



Similarly, 



^ m 

m ^-^ 

1=1 

i=l ^ ^ 



m ^ 



i=l 



de 



^ m 

+o{i)\\em-ef-y^ sup 

m ^ W-9\\<S 



de 



(5.21) 



- 1 1 MO, + iO^ - 0) (e^)% 0(te)(5.22) 
It follows that 

Pm = 1'+— Xll^i ^j)-E7ri]+0( Vlog log m/m) = ?;+0(-yiog log m/m) a.s. 
and 



1=1 



tTot = ni; + 0(-\/n log logn) a.s. 



m=l 



Now, write 



g{7r,a,b) 



7r(6/a) 



7r(6/a)7 + (l-7r)((l-6)/(l-a))n 



(5.23) 



Then V'm+i.i = 5 (T^m; ^m,\/fni pm)- It is easily seen that g{'K, a, b) is a non- 
decreasing function of b, and so g{TT,a,b) < g{ir,a,a) = vr if a > b. Let 
In = max{m < n : Nm,i/m < Pm}, then V'm+i.i < T^m when m > + 1. 
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Hence 

n-l 

m=ln+l 
n-l 

<1 + Ar,„,l + M„,i - M;„+i,i + ^rn 

m=ln+l 
n-l In 

<1 + InPln + Mn,l - Mi^+i^i + ^ - XI ^™ (^-2^) 

m=l m=l 

<nv + 0(A/n log logn) a.s. 

Similarly, 

n — Nn,i < n(l — v) + 0{y/nlog logn) a.s. 
(j3.13p and (j5.16p are now proved. 

Step 2. We show p.l2p and the asymptotic normality of On- 

By IKW and (l536|) . Pn/{Nn,i/n) 1 a.s.. And hence (IXT2I1 is proved, 
and further tpm,i — vri(0m-i,^m) a.s. Then, it is easily check that Qn 
is a martingale with 



1 

- X E [(AQ„)'^AQ„] 

m=l 
1 " 

= - X diag(E [V'm.l^lC^^m,!, Im)^^l(^m,l, ^m)] , 
m=l 

So, applying the central limit theorem for martingales yields 

9 



n 



i/2(0^_0)4iV(o,F). 



The proof of Step 2 is completed. 
Step 3. We show that 

) f Nm,l ^ \ doglogm, /r- oc:\ 

Vm+1,1 = TTm - 7 ^ Pm ] + 0{ ) a.S. (5.25) 

v[l — V) \ m J m 
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Let g{TT,a,b) be defined as in ()5.23p . By some elementary argument, it 
can be showed that 



sup 

0<7r<l 



g{Tr, a, 6) - vr + -j— (a - b) 



v{l — v) 

as (a, 6) {v,v). By U^TTSh and ([536]) . it follows that 



0{ia-vf + {b-vf}, (5.26) 



sup 

0<7r<l 



g{-K,Nm,i/m,pn 



vr + 7 



^(l-vr) /7V^,i 



/On 



f (1 — v) \ m 

()5.25p is now proved. 

5iep At last, we show the asymptotic normality of A', 



log logm 
C( ) a.s. 



m 



Notice Nm,i/m — pm = ©(y^log log m/m) a.s.. With the same argument 
as deriving (j5.20p . we can show that 



E 



(1 - TTm) ( Nm,l 

v{l - v) 



m 



E 



E[^l(l-7ri)] (NmA 



u(l - v) 
By (fOSi) it follows that 



Pm. + o{{\ognf) a.s. 



m=l m=l m=0 



n-1 



A ^ f ^^^^^ -Pm]+ o{{\ognf) a.s. 

m=l ^ 
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Then 

n 

Nn,l -nv = Mn,l + ^ V'm-l,! - 



nv 

m=l 

n-1 



m=l ^ ^ 

=M„,i + M„,, + A g ^ + (A + 1) E (^n. - e) (e^) 

m=l m=0 ^ ^ 

m=l ^ ^ 

=M„,i + M„,2 + A E + (1 + o(l)) (A + 1) E 



m / \ ■'^ — ' m 

Tn=l / \ m=l 



m=l 

On the other hand, 

E[AMm,iAMm,j\^m-l] = 0, i ^ j, 
E[(AM^,i)2|^^_i] = E[V'™,i(l - V'm.Ol^m-l] ^ a? a.s., 
E[iAMm,2f\^m-i] = yar{7r,{e,U)] = 4 

and 

By applying the function central limit theorem (c.f., Corollary 3.1 of Hall 
and Heyde, 1980), we have 



n 



-1/2 (M[„,p,M[„,],2,M[„,],3) 4 (a^B\'\a2B^^\a^Bf) , 



where B^' , i = 1,2,3, are three independent standard Brownian motions. 
Then with the same argument as in Hu and Zhang (2004a), one can show 
that 

n-'/\N^nt],i - [nt\v) ^ Gu 
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where 



Jo Jo 



is a solution of the equation 



(2) 



with Go = 0. It is easily checked that 



. / -^dx +(A+1)CJ3 / 

Jo X J Jo ^ 



dx — X I — -dx 



Var(Gt) = t 
Hence 



2(A + 1) 



1 + 2A ^ 1 + 2A ^ 



; h <7n + (To 

1 + 2A 2 ^ 



n 



'/'(iVn,i/n-t;)^iV(0,a2). 



Proof of Corollary 13.11 It is sufficient to show the strong continency of 
the MLE Om-. 

On ^ e. (5.27) 



In fact, if (fOT]) is proved, then by (f5l9]l and ([5:21]) we have — > v a.s. 
and ^ Ylm=i — > V a.s.. By (j5.24p we will have Nn/n — > v a.s. It follows 
that ^prn,k — T^ki^m-i, a.s. by (|5.26p . The rest proof is similar to 

Corollary 3.1 of Zhang et al (2007). 

For (|5.27|) . it suffices to show that, for any 6 > small enough, with 
probability one for m large enough we have 



logLkiei) < logLkiOk), if ll^fc - dkW = 5. 



(5.28) 



We consider the case k = 1 only. The application of Taylor's theorem yields 



\ogLi{ei 



1 



ni 



logLi(0i) 



1 91ogLi 



+ {ei-e 



m dOi 
1 



){^ 



f 


r a^iogLi 


6»i+t(e*-0i) 


Jo 


[ del 


01 



dt 



}m-0i 
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Write 



^1(^,0 (!)V(l-^i(z,0) (iff 



7 ■ 



It is obvious that / is a continuous function of a, b and z for each give ^. 
By applying the law of large numbers for martingales, one can show that 



1 91ogLi 



m dOi 



a.s. 



and 

9^ log Lj 



del 



i=2 



— ,b=Pj_l,z=0j_i 



+o(m) a.s. 



For the details of the proof, one can refer to Zhang et al (2007). Further, it 
is obvious that 



^ m 

limsupp^m < liixL — > sup7ri(0,^j) = E[sup7ri(0, ^)] < 1 a.s., 

e e 

where the superior is taken over the parameter space. And similarly 

1 " ^ 

limsup— vfm < E[sup7ri(0, ^)] < 1 a.s. 

m=l 

By O, 

limsup iVn^i/n < E[sup7ri(0,^)] < 1 a.s. 


By considering 1 — pm and n — A''„^i instead of pm and Nn^i respectively, we 
have 

liminf > E[inf 7ri(0, ^)] > and liminf iV„_i/n > E[inf 7ri(0, ^)] > a.s. 


So we may assume that PmiNn^i/n G [5q, 1 — 5q\ for some < (5o < 1. On 
the other hand, it is obvious that yE[/(a, 6, z, is a continuous 

function of a, 6, y, z, and is positive for all < a, & < 1, y 7^ and all z. It 
follows that there is a constant cq > for which 



hminf min (yE[/(a, 6, z, y^) 



j^oo y:\\y\\=l 



"j-i- 



=Pj_i,z=6>^_i 



> Co a.s. 
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So with probability one for m large enough it holds that 

-logLi(0^)-llogLi(0i) 
m m 

^ m 

~f,y-\\y\\=^ " 

+ \\ei-eifH{\\ei-ei\\) + o{i) 

< - co5^ + 6'^H{6) + o(l) < uniformly in 61 with - Oi 
when 6 is small enough. (|5.28p is proved. 
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